Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning

نویسندگان

  • Tianmin Shu
  • Caiming Xiong
  • Richard Socher
چکیده

Learning policies for complex tasks that require multiple different skills is a major challenge in reinforcement learning (RL). It is also a requirement for its deployment in real-world scenarios. This paper proposes a novel framework for efficient multi-task reinforcement learning. Our framework trains agents to employ hierarchical policies that decide when to use a previously learned policy and when to learn a new skill. This enables agents to continually acquire new skills during different stages of training. Each learned task corresponds to a human language description. Because agents can only access previously learned skills through these descriptions, the agent can always provide a human-interpretable description of its choices. In order to help the agent learn the complex temporal dependencies necessary for the hierarchical policy, we provide it with a stochastic temporal grammar that modulates when to rely on previously learned skills and when to execute new skills. We validate our approach on Minecraft games designed to explicitly test the ability to reuse previously learned skills while simultaneously learning new skills.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sition in Multi-task Reinforcement Learning

Learning policies for complex tasks that require multiple different skills is a major challenge in reinforcement learning (RL). It is also a requirement for its deployment in real-world scenarios. This paper proposes a novel framework for efficient multi-task reinforcement learning. Our framework trains agents to employ hierarchical policies that decide when to use a previously learned policy a...

متن کامل

The Effect of Pairwise Video Feedback on the Learning of Elegant Eye-Hand Coordination Skill

The present paper aimed to study the effect of pairwise video check feedback (including the observation of external pattern of skill performance and performing the skill simultaneously) on the learning on acquisition and learning of eye-hand coordination skill. Computer skill of eye-hand coordination skill was the tool used in this study. 24 subjects were randomly selected and equally divided...

متن کامل

Stochastic reinforcement benefits skill acquisition.

Learning complex skills is driven by reinforcement, which facilitates both online within-session gains and retention of the acquired skills. Yet, in ecologically relevant situations, skills are often acquired when mapping between actions and rewarding outcomes is unknown to the learning agent, resulting in reinforcement schedules of a stochastic nature. Here we trained subjects on a visuomotor ...

متن کامل

Toward the Autonomous Acquisition of Robot Skill

The design and coordination of independent specialized skill units (often called action primitives) is fundamental to modern robotics. However, a robot that must act in a complex environment over an extended period of time should do more than just use existing skills: it should learn new skills that increase its capabilities and facilitate later problem solving. Although robots exist that can l...

متن کامل

Active Learning of Parameterized Skills

We introduce a method for actively learning parameterized skills. Parameterized skills are flexible behaviors that can solve any task drawn from a distribution of parameterized reinforcement learning problems. Approaches to learning such skills have been proposed, but limited attention has been given to identifying which training tasks allow for rapid skill acquisition. We construct a non-param...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1712.07294  شماره 

صفحات  -

تاریخ انتشار 2017